Unsupervised feature learning on monaural DOA estimation using convolutional deep belief networks
نویسندگان
چکیده
In recent years, deep learning approaches have gained significant interest as a way of building hierarchical representations from unlabeled data. Additionally, in the field of sound direction-of-arrival (DOA) estimation, the binaural features like interaural time or phase difference and interaural level difference, or monaural cues like spectral peaks and notches are often used to estimate sound DOA. Although these binaural or monaural cues successfully explained human sound DOA ability, its accuracy and extent are all limited to the human knowledge on feature extracting methods. In this paper, we are interested in applying a deep learning approach to monaural sound localization based on monaural spectral cues. A convolutional deep belief network is applied to monaural auditory spectrograms processed by a computational auditory model to learning monaural features automatically. The learned features are then regressed using a support vector regression (SVR) model for sound DOA estimation tasks. Moreover, our feature representations learned from unlabeled monaural auditory spectrograms are then compared to the well-known binaural features. The results indicate that our monaural model shows reasonable performance at the task of 3D DOA estimation.
منابع مشابه
Unsupervised feature learning for audio classification using convolutional deep belief networks
In recent years, deep learning approaches have gained significant interest as a way of building hierarchical representations from unlabeled data. However, to our knowledge, these deep learning approaches have not been extensively studied for auditory data. In this paper, we apply convolutional deep belief networks to audio data and empirically evaluate them on various audio classification tasks...
متن کاملKeyword Spotting with Convolutional Deep Belief Networks and Dynamic Time Warping
To spot keywords on handwritten documents, we present a hybrid keyword spotting system, based on features extracted with Convolutional Deep Belief Networks and using Dynamic Time Warping for word scoring. Features are learned from word images, in an unsupervised manner, using a sliding window to extract horizontal patches. For two single writer historical data sets, it is shown that the propose...
متن کاملEstimation of Hand Skeletal Postures by Using Deep Convolutional Neural Networks
Hand posture estimation attracts researchers because of its many applications. Hand posture recognition systems simulate the hand postures by using mathematical algorithms. Convolutional neural networks have provided the best results in the hand posture recognition so far. In this paper, we propose a new method to estimate the hand skeletal posture by using deep convolutional neural networks. T...
متن کاملMeasuring Invariances in Deep Networks
For many pattern recognition tasks, the ideal input feature would be invariant to multiple confounding properties (such as illumination and viewing angle, in computer vision applications). Recently, deep architectures trained in an unsupervised manner have been proposed as an automatic method for extracting useful features. However, it is difficult to evaluate the learned features by any means ...
متن کاملRegularization for Unsupervised Deep Neural Nets
Unsupervised neural networks, such as restricted Boltzmann machines (RBMs) and deep belief networks (DBNs), are powerful tools for feature selection and pattern recognition tasks. We demonstrate that overfitting occurs in such models just as in deep feedforward neural networks, and discuss possible regularization methods to reduce overfitting. We also propose a “partial” approach to improve the...
متن کامل